Periodic distributions of hydrophobic amino acids allows the definition of fundamental building blocks to align distantly related proteins.

نویسندگان

  • J Baussand
  • C Deremble
  • A Carbone
چکیده

Several studies on large and small families of proteins proved in a general manner that hydrophobic amino acids are globally conserved even if they are subjected to high rate substitution. Statistical analysis of amino acids evolution within blocks of hydrophobic amino acids detected in sequences suggests their usage as a basic structural pattern to align pairs of proteins of less than 25% sequence identity, with no need of knowing their 3D structure. The authors present a new global alignment method and an automatic tool for Proteins with HYdrophobic Blocks ALignment (PHYBAL) based on the combinatorics of overlapping hydrophobic blocks. Two substitution matrices modeling a different selective pressure inside and outside hydrophobic blocks are constructed, the Inside Hydrophobic Blocks Matrix and the Outside Hydrophobic Blocks Matrix, and a 4D space of gap values is explored. PHYBAL performance is evaluated against Needleman and Wunsch algorithm run with Blosum 30, Blosum 45, Blosum 62, Gonnet, HSDM, PAM250, Johnson and Remote Homo matrices. PHYBAL behavior is analyzed on eight randomly selected pairs of proteins of >30% sequence identity that cover a large spectrum of structural properties. It is also validated on two large datasets, the 127 pairs of the Domingues dataset with >30% sequence identity, and 181 pairs issued from BAliBASE 2.0 and ranked by percentage of identity from 7 to 25%. Results confirm the importance of considering substitution matrices modeling hydrophobic contexts and a 4D space of gap values in aligning distantly related proteins. Two new notions of local and global stability are defined to assess the robustness of an alignment algorithm and the accuracy of PHYBAL. A new notion, the SAD-coefficient, to assess the difficulty of structural alignment is also introduced. PHYBAL has been compared with Hydrophobic Cluster Analysis and HMMSUM methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance of 2-Amino Tetraphenyl Porphyrin as Stationary Phase in RP-HPLC of Amino Acids

The search for new stationary phases has been one of the predominant concerns in high performance liquid chromatography (HPLC) in order to achieve better resolutions, longer column lives, and reduce the time of analysis. A chromatographic packing for separation of underivatized amino acids (AAs) were prepared by dynamically coating 2-amino tetraphenyl prophyrin (atpp) on a C-18 reversed-pha...

متن کامل

Genetic Analysis of Three Structural Proteins in Iranian Infectious Bronchitis Virus Isolate

Infectious bronchitis virus (IBV) is a contagious pathogen in fowl that results in economic loss in the poultry industry. In this study, the amino acids sequences of three structural proteins M, N, and S1 for five Iranian IBV isolated during 1998-2011 have been analyzed. Conserved and variable regions, hydrophobic characteristics and identity matrix were determined after alignment by Bioedit ve...

متن کامل

Thermodynamic-Biochemical Study of Complexes of Intermediate Elements with α-Amino Acids in Some Proteins with Active Site

In this paper, the quantum chemistry calculations related to the structural parameter of the chromite and molybdate anions and the complexes obtained from them with the glycine and alanine amino acids were performed. The calculations were carried out using HF and DFT methods and in the base series 6-31G *. Thermodynamic studies related to the formation of complexes have been considered and thei...

متن کامل

Inconsistent Distances in Substitution Matrices can be Avoided by Properly Handling Hydrophobic Residues

The adequacy of substitution matrices to model evolutionary relationships between amino acid sequences can be numerically evaluated by checking the mathematical property of triangle inequality for all triplets of residues. By converting substitution scores into distances, one can verify that a direct path between two amino acids is shorter than a path passing through a third amino acid in the a...

متن کامل

Application of Genetic Programming to Modeling and Prediction of Activity Coefficient Ratio of Electrolytes in Aqueous Electrolyte Solution Containing Amino Acids

Genetic programming (GP) is one of the computer algorithms in the family of evolutionary-computational methods, which have been shown to provide reliable solutions to complex optimization problems. The genetic programming under discussion in this work relies on tree-like building blocks, and thus supports process modeling with varying structure. In this paper the systems containing amino ac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Proteins

دوره 67 3  شماره 

صفحات  -

تاریخ انتشار 2007